Learning Throttle Valve Control Using Policy Search

نویسندگان

  • Bastian Bischoff
  • Duy Nguyen-Tuong
  • Torsten Koller
  • Heiner Markert
  • Alois Knoll
چکیده

The throttle valve is a technical device used for regulating a fluid or a gas flow. Throttle valve control is a challenging task, due to its complex dynamics and demanding constraints for the controller. Using state-of-the-art throttle valve control, such as model-free PID controllers, time-consuming and manual adjusting of the controller is necessary. In this paper, we investigate how reinforcement learning (RL) can help to alleviate the effort of manual controller design by automatically learning a control policy from experiences. In order to obtain a valid control policy for the throttle valve, several constraints need to be addressed, such as no-overshoot. Furthermore, the learned controller must be able to follow given desired trajectories, while moving the valve from any start to any goal position and, thus, multi-targets policy learning needs to be considered for RL. In this study, we employ a policy search RL approach, Pilco [2], to learn a throttle valve control policy. We adapt the Pilco algorithm, while taking into account the practical requirements and constraints for the controller. For evaluation, we employ the resulting algorithm to solve several control tasks in simulation, as well as on a physical throttle valve system. The results show that policy search RL is able to learn a consistent control policy for complex, real-world systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study of the Mechanism and Characteristics of a Pwm-based Valve Control System with No Throttle Loss

For the problem of low efficiency of conventional valve control systems, an economical PWM-based valve control system with no throttle loss is put forward and examined in this paper. There are four no-throttling-loss units used to control a single rod cylinder in this fixed displacement pump system, which breaks the mechanical linkage of meter-in and meter-out flow in the traditional four way d...

متن کامل

Robust ℋ2 static output feedback to control an automotive throttle valve

The paper presents a control strategy for an automotive electronic throttle body, a device largely used into vehicles to increase the efficiency of the combustion engines. The synthesis of the proposed controller is based on a linear matrix inequality (LMI) formulation, which allows us to deal with uncertainties on the measurements of the position of the throttle valve. The LMI approach generat...

متن کامل

Learning Deep Neural Network Control Policies for Agile Off-Road Autonomous Driving

We present an end-to-end learning system for agile, off-road autonomous driving using only low-cost on-board sensors. By imitating an optimal controller, we train a deep neural network control policy to map raw, high-dimensional observations to continuous steering and throttle commands, the latter of which is essential to successfully drive on varied terrain at high speed. Compared with recent ...

متن کامل

Imece 2005 - 81376 Software Enabled Variable Displacement Pumps ∗

Direct pump control of hydraulic systems is more energy efficient than throttle valve based methods to control hydraulic systems. This requires variable displacement pumps that are responsive and capable of electronic control. Such Electronic Displacement Controlled (EDC) pumps tend to be significantly larger, heavier and more expensive than fixed displacement counterparts. In addition, achieva...

متن کامل

Asymmetric Modelling and Control of an Electronic Throttle

This paper presents an improved model for an automotive electronic throttle inspired on the behavior observed in real-time experiments. Due to a number of issues, particularly the return-spring, the performance of the throttle valve depends on whether it is opening or closing. This asymmetric behavior was taken into account to design a mathematical model of the throttle body and to derive a non...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013